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The advent of electronic computers has revolutionised the application of statistical me- 
chanics to the liquid state. Computers have permitted, for example, the calculation of the 
phase diagram of water and ice and the folding of proteins. The behaviour of alkanes ad- 
sorbed in zeolites, the formation of liquid crystal phases and the process of nucleation. 
Computer simulations provide, on one hand, new insights into the physical processes in 
action, and on the other, quantitative results of greater and greater precision. Insights into 
physical processes facilitate the reductionist agenda of physics, whilst large scale simula- 
tions bring out emergent features that are inherent (although far from obvious) in complex 
systems consisting of many bodies. It is safe to say that computer simulations are now 
an indispensable tool for both the theorist and the experimentalist, and in the future their 
usefulness will only increase. 

This chapter presents a selective review of some of the incredible advances in condensed 
matter physics that could only have been achieved with the use of computers. 

1 Introduction 

Most mechanical problems that have an element of realism are analytically intractable; a 
famous example being the motion of three or more mutually attractive bodies. However, 
almost all of these problems are suseptible to computation, leading to approximate solutions 
of arbitrary accuracy. Likewise, very few problems in statistical mechanics are exactly 
solvable [1]. It is because of this that computational methods are so invaluable. 

2 Statistical mechanics of liquids before computers 

The study of liquids is a field that has benefitted greatly from advances in computational 
power. Initial theoretical approaches to the study of liquids did not meet with the same level 
of progress as did the study of weakly interacting gasses and crystalline solids. A great part 
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of the problem lies in the fact that there is no simple, analytically tractable, idealised model 
to start from. Liquids incorporate some configurational structure, as demonstrated by their 
radial distribution functions, but they also have a dynamical aspect typified by gasses, i.e. 
diffusion. Thus the treatment of liquids as either high temperature solids, or as dense gasses, 
met with little success. 

Other factors have helped to stymie an analytical development of the theory of liquids, 
such as the failure [2] of the, reasonable sounding, Kirkwood superposition approximation 
[3]. This approximation treated a three-body correlation as being the result of three two- 
body correlations: 

g g)( ri ,r 2 , r 3)= g g ) ( ri ,r 2 ) g ^(r 2 , r3 ) g g ) ( r 3,r 1 ), (D 

in) 

where g N is the n-particle distribution function. This approximation was central to the 
Born, Bogoliubov, Green, Kirkwood, and Yvon (BBGKY) hierarchy of integral equations. 
The BBGKY relations are exact, but closed solutions could now only be obtained for pair- 
wise additive systems. However, the importance of three-body forces on the gas-liquid 
equilibrium has been highlighted by Anta et al. [4]. Also, in a description of binary liquids, 
the use of accurate two-body potentials must be supplemented by three-body terms in order 
to obtain realistic results [5]. 



2.1 Liquids as a dense gas 

For an ideal gas one has the famous 'ideal gas law', 

p _ Nk B T p 
V 0' 



(2) 



where P is the pressure, N is the number of molecules, fee is the Boltzmann constant, T is 
the temperature, and V is the volume, p is the number density, N/V, and (3 is defined to 
be 1/HbT. In an attempt to describe a liquid as non-ideal gas one can expand the ideal gas 
expression as 

P=^( 1 + ^> + ^ + ...). (3) 

where B n (T) is known as the n th virial coefficient (such an expression can be justified via 
a diagrammatic Mayer expansion of the grand partition function in irreducible clusters [6]). 
With a knowledge of the virial coefficients one is able to then calculate the fugacity, internal 
energy, enthalpy, molar heat, entropy, and the Joule-Thompson coefficients [7]. 
The second virial coefficient is given by 

B 2 {T) = ~J f(r)dr, (4) 

and the third is given by 

Bs(T) = ~\f f f(r)f(r')f(\r - r'\)drdr', (5) 
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Figure 1: Hard sphere equation of state: the solid line Carnahan-Starling equation of state, 
the dashed line is the virial expansion up to B4. 

where f(r) is the Mayer /-function, defined as f(r) = exp[— f3<&(r)] — 1, and <I>(r) is 
the interaction potential. As can be seen, the expressions for the virial coefficients become 
increasingly involved. For example, for one of the simplest systems, the three dimensional 
hard sphere fluid, only B2, B% and B4 have been derived analytically [8,9], and numerically 
intensive computations have had to be performed for B$ and beyond, (for example, Bio 
requires the evaluation of 4,980,756 Ree-Hoover diagrams [10]). In Fig. 1 the equation 
of state derived using the virial coefficients up to B4 is compared to the almost 'exact' 
Carnahan-Starling equation of state [11]. It can be seen that the curves start to diverge at 
packing fractions as low as rj = 0.2 (rj = ^pd 3 , d is the hard sphere diameter. The fluid- 
solid transition is located at rj ps 0.49). Thus the virial equation approach is only really 
suitable for dilute systems and gasses. 

In view of this virial expansions for liquids should be treated with caution; for three 
dimensional hard spheres the radius of convergence of the virial series is still unknown 
even after years of study. As yet the virial expansion is unable to reproduce the freezing 
transition found in simulation studies [12, 13]. All said and done it should be borne in mind 
that the virial expansion is of empirical origin. 

2.2 Liquids as a disordered solid 

The Lennard-Jones-Devonshire cell model [14-16] was based on the idea that molecules are 
trapped in a lattice of cages, each molecule having its own fixed cage within which it can 
move freely. At low temperatures this cage, or well, has the parabolic form characteristic 
of high temperature solids. This well becomes increasingly an-harmonic as the temperature 
is increased. The problem with this model is that the positions of the atoms in a fluid are 
correlated, leading to a dynamical feedback in which each of the atoms forming the cage 
has its own cage. Treating the cage as being immobile only works as a first approximation. 
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Cell theory is at its best near the triple point when the molecules are highly confined [17], 
its usefulness declining as one approaches the region of the critical point, p c [18]. 

The (Lenz-)Ising model is another solid-like model [19,20] in that it is based on a fixed 
regular lattice. Good use was made of this model in the study of critical points, thanks to the 
mathematical tour de force performed by Lars Onsager, who found an analytical solution 
to the 2-dimensional case [21]. However, it has recently been shown that an analytical 
solution to the 3-dimensional case is intractable [22]. Ising models consist of a series of 
lattice points. In the original implementation the lattice points were occupied by either 
spin-up or spin-down sites to represent magnets, in order to study ferromagnetism. The so 
called 'lattice gas' of Lee and Yang was an adaptation of the Ising model, where the spins 
up and down were replaced by occupied or un-occupied sites [23,24]. The bond-fluctuation 
model, also based on a lattice, has been very fruitful in the study of polymer melts [25,26]. 
Another lattice model is that of Lebwhol and Lasher [27] who studied the Maier-Saupe 
mean field model of the isotropic-nematic transition [28,29]. However, given the analytical 
intractability of these models, useful solutions must be obtained via computer simulation 
studies. 

3 The advent of electronic computers 

From the very outset, electronic computers have been applied to the study of statistical me- 
chanics. Perhaps the paterfamilias of modern computers is the 'Mathematical Analyzer, 
Numerical Integrator, and Computer' (MANIAC-I, 1952-1956) built by Nicholas Metropo- 
lis's Theoretical Division at the Los Alamos National Laboratory [30,31]. 

MANIAC was used extensively by Marshall and Arianna Rosenbluth. Their work in- 
cluded Monte Carlo simulations of two-dimensional Lennard- Jones particles and the equa- 
tion of state for three dimensional hard spheres [32] (also examined by Alder, Frankel and 
Lewison [33]) and polymer chains [34] as well as the incredibly important Metropolis al- 
gorithm [35]. MANIAC-I was also used to study ergodicity by Fermi, Pasta and Ulam [36], 
leading to the long-standing 'FPU' problem that adopts their initials [37]. 

In 1957, using an IBM 704 computer, Wood and Jacobson recalculated the hard sphere 
equation of state [12, 38], finding agreement with the molecular dynamics work of Alder 
and Wainwright [13]. In the same year Wood and Parker also undertook a study of three 
dimensional Lennard- Jones particles [39]. These publications opened the door to the study 
of liquids from an atomistic/molecular perspective. It is for this reason that Temperley wrote 
"A key year for liquid state physics was 1957" [40]. For personal perspectives on the early 
history of molecular dynamics and Monte Carlo written by some of the main protagonists, 
see articles written by Wood [41] and by Rosenbluth [42]. 

3.1 Monte Carlo and molecular dynamics 

The two principal techniques that are the mainstay of applied statistical mechanics of liquids 
are Monte Carlo integration (MC) and molecular dynamics (MD) simulations. Stochastic 
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methods have a long history. However, MC in its present form has its roots in the work of 
Nicholas Metropolis and Stanislaw Ulam [43,44], along with the Metropolis importance 
sampling algorithm developed by Marshall Rosenbluth (this paper having over 9300 cita- 
tions at the time of writing) [35]. Molecular dynamics was developed a little later by Bernie 
Alder along with Tom Wainwright [13]. 

Classical molecular dynamics plays out the trajectories of the particles. 

dq t ({q(t)},{p(t)}) ,~ 
, (o) 



dt dt 
d Pi (t) d Pi ({q(t)},{p(t)}) 



(7) 



dt dt 

where q±, q^ are the positions of the atoms, and p\, ...,pn are the respective momenta 
for the N degrees of freedom. One of the most famous algorithms for solving these equa- 
tions is the Verlet scheme [45] 

Ti(t + 8t) = 2r { (t) - n(t - St) + 5t 2 , (8) 

TTij 

where 5t is the so called 'time-step'. The time step is governed by the time scale of the 
fastest motions in the system (for example, to about l/10 th of the bond stretching frequency 
of the lightest atoms in a simulation of a molecule). 

The time average of a particular observable of the system, O is defined as 

O T ({q (t)},{P0(t)}) = ^ f Q 0({q(t)},{p(t)}) dt. (9) 

These two techniques have both their advantages and disadvantages. For example, with 
MC it is very difficult to obtain dynamical information. With MD phase space exploration 
takes much longer. Monte Carlo is usually performed in the canonical ensemble, leading to 
p(N, V, T) and U(N, V, T). Molecular dynamics in the micro-canonical ensemble, giving 
p(N, V, U) and T(N, V, U). 

The Ergodic hypothesis [46,47] essentially states that an ensemble average (MC) of an 
observable, (O) is equivalent to the time average, Ot of an observable (MD). i.e. 

T lim T ({q (t)},{po(t)}) = {0) lx . (10) 

A restatement of the ergodic hypothesis is to say that all allowed states are equally prob- 
able. With this in mind one chooses the computational technique most suited to type of 
information one wishes to obtain. 



3.2 Random number generators 

At the heart of any Monte Carlo computer code lies a pseudo-random number generator 
(RNG). The generator is pseudo since the series of numbers originate from a deterministic 
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computer code. One of the first RNG's was the so called 'linear congruential generator' 
(LCG), proposed by D. H. Lehmer in 1951 [48] The endearing features of this algorithm 
is that it is simple, easy to program, portable across computing platforms and fast. This 
Lehmer algorithm has come to be known as a minimal standard against which many new 
algorithms are compared with. A good number of Monte Carlo integration computer codes 
use the Lehmer algorithm (or a combination thereof [49,50]) to provide the pseudo-random 
numbers they require. The Lehmer algorithm can be written as 

y n+1 = ay n + b (mod to), (11) 

where the user chooses a, b, to, and a seed value to initiate the algorithm, dq. A very 
popular implementation of the LCG is the prime modulus multiplicative linear congruential 
generator (PMMLCG) [51]. The parameter to should be prime and as large as possible 
without causing a numerical overflow on the computer that it is running on. For example, 
for a 32-bit (31 bit + 1 sign bit) word size then the logical choice of to is the Mersenne 
prime 

m = 2 31 - 1 = 2147483647, (12) 

with a = 7 5 (a positive primitive root of to [52, 53]), and 6 = With these parameters one 
is able to generate a series of 2.147 x 10 9 pseudo-random numbers from one seed value. 
For an interesting discussion on how to choose an initial seed value see [54]. For a list of 
other values of to and a see Ref. [55] and for its use on 64-bit computers see [56]. 

Not all random number generators are equally good. [51,57,58], Some generators were, 
and actually are, quite bad (such as the infamous RANDU [51]). Finding a suitable RNG 
for a given application is a very important task. 

Given the extensive use of the Monte Carlo technique in elementary particle physics 
calculations Martin Luscher developed a RNG called RANLUX [59, 60] (available from 
the CPC Program Library [61]) based on the RCARRY algorithm proposed by Marsaglia 
and Zaman [62]. Luscher describes his algorithm as being a 'discrete approximation 
to a chaotic dynamical system'. The RANLUX routine has a period of an incredible 
« 1 x 10 171 random numbers. 



3.3 Liquids as a liquid 
3.3.1 Perturbation theory 

Perturbation theories are a very useful tool in many areas of physics. The idea is to take a 
known 'simple' system as the reference system (for example, the hard sphere fluid [63] is 
an ideal reference system for the study of, for example, liquid argon, as demonstrated by 
Zwanzig [64]) and then treat, say, attractive forces as a perturbation of the reference system. 
A classic example of a perturbation theory is the Van der Waals model [65, 66]. Another 
famous example is that of Barker and Henderson, who used the hard sphere fluid as the 
reference potential in a study of the square well system [67]. They also examined the 6:12 
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potential [68] using the Zwanzig formulation. Weeks, Chandler and Andersen divided the 
Lennard -Jones potential into repulsive and attractive sections [69-71]. These approaches 
have worked well because repulsive forces are very important in fluid phase [70], although 
naturally, without attractive forces there is no gas-liquid transition. 

For 'polymers' Wertheim developed the 'first order thermodynamic perturbation theory' 
(TPT1) [72-76], also known as the 'self associating fluid theory' (SAFT) [77-79]. TPT1 
allows one to calculate the properties of a fluid chains simply from a knowledge of the 
monomer fluid, 



where Z mon is the reference equation of state for the monomer system, and m is the number 
of monomers in the chain. For example, if one wishes to study the equation of state of a fluid 
composed of Lennard- Jones chains one can do so easily if one has access to the equation 
of state of the Lennard- Jones monomer fluid. It is interesting to note that in this case that 
the TPT1 theory, originally developed for the fluid phase, has been successfully transfered 
to the solid phase [80-83]. 

Needless to say, perturbation theories intimately depend on the quality of the data for 
the reference system, which is almost always obtained from computer simulation data. 

3.3.2 Integral equations 

In the treatment of a liquid as a liquid the Ornstein-Zernike relation [84] is a shining (or 
maybe one should say 'opalescent') example. The Ornstein-Zernike relation was born of a 
study of the enfant terrible of the liquid state; a singularity known as the the critical point, 
found at the very end of the liquid-gas transition line. 

For a homogeneous and isotropic fluid the Ornstein-Zernike relation is given by 



where h{r) is the total correlation function (which is g(r) less the mean field contribution 
of 1). This relation defines the new function, c(r), known as the direct correlation function. 
This relation expresses the notion that the 'total' correlation between atoms 1 and 2 at a dis- 
tance r is given by the 'direct' correlation between 1 and 2, along with indirect correlations 
due to contributions from the rest of the atoms in the fluid, i.e. the influence of atom 3 on 
atom 2 due to the influence of atom 1 on atom 3 etc. This can be seen by expanding the 
above expression: 




(13) 




(14) 



h(r) = c(r) + 



+ 



+ 




+ 



(15) 
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Obviously, a fluid, because of its nature, is composed of a large number of atoms. This 
makes the aforementioned exact relation unwieldy and impracticable to say the least. For 
this under-determined equation to be useful one must look for a closure relation. Such 
closure relations are either asymptotic schemes, involving much mathematical work but 
leading to an estimate of the errors involved, or some form of truncation scheme, in which 
higher order moments are arbitrarily assumed to vanish, but possibly resulting in a dras- 
tic approximation. In 1960 Morita and Hiroike [85] produced the formally exact closure 
formula, 



where $(r) is the pair potential. This reduced the task to finding the so called 'bridge' 
functional B[h(r)] (so named due to its diagrammatic similarity to the famous Wheatstone 
bridge) rather than the entire closure relation. However, the bridge functional represents 
an infinite sum of diagrams. Sadly, the bridge function is not known exactly even for 
the simplest model; one component hard spheres. Apart form the most simple cases (for 
example, for the Percus-Yevick closure [86] for hard spheres [87, 88] or mixtures thereof 
[89]) modern use of integral equations have to rely on computers to obtain iterative solutions 
[90-92] to the various approximate closures of the OZ equation. 

4 Progress 

In this section I shall focus on a small number of liquid systems that are currently bene- 
fitting, and will continue to benefit from, fast electronic calculating machines. The orig- 
inal simulation of 56 two-dimensional Lennard- Jones atoms was performed in 1953 [32] 
on the MANIAC machine at Los Alamos. In 2003 a simulation was performed, on the 
QSC machine again at Los Alamos, for a system of 19,000,416,964 splined Lennard-Jones 
sites [93]. In terms of a Moore's Law [94] like scaling, from these two data points the 
number of atoms that one can simulate seems to double roughly every 21 months. Thus any 
computational tour deforce mentioned here is perhaps by now passe. 

4.1 First Principles Molecular Dynamics 

First principles molecular dynamics, developed by Car and Parrinello [95, 96], involves 
solving the Kohn-Sham equations [97]. These are a set of non-linear, coupled integro- 
differential partial differential equations which make use of the Born-Oppenheimer approx- 
imation [98] (for a recent study of the validity of this approximation see [99]). Thus ions 
can be treated as classical particles obeying Newton's laws of motion. The potential is 
defined as 



where H(ri) is the many -body electronic Hamiltonian, and is the ground state eigen- 
f unction. The forces are then found using the Hellman-Feynman theorem [100], 



h(r) - 1 = exp (-/?$(r) + h(r) - c(r) + B[h{r)]) , 



(16) 



$(n) = <$o|W(ri)|*o> 



(17) 




(18) 
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One of the beauties of first-principles molecular dynamics (FPMD) is that it is able to sim- 
ulate chemical reactions, i.e. the making and breaking of chemical bonds, something not 
open to standard molecular dynamics techniques. On the other hand, one of the great hin- 
drances to FPMD is that the simulations scale as iV 3 where N is the number of ions. Here 
I should like to mention a recent simulation, not of a liquid but of a solid metal. I do this 
because of the impressive size of the system: 1000 atoms of the transition metal molyb- 
denum (Mo) [101] (a system comprising of 12,000 electrons) have been studied on the 
Blue Gene/L computer, currently the worlds fastest computer (having 131,072 processors). 
Another system, studied using pseudopotential theory interaction potentials, is the solidifi- 
cation of 32,768,000 atoms of the molten metal tantalum (Ta) [102], again using the Blue 
Gene/L computer. 

4.2 Liquid Crystals 

Liquid crystals are a phase of matter situated between the liquid and the solid phases. They 
flow as liquids do, but also present crystalline-like properties, such as birefringence (i.e. are 
anisotropic). Liquid crystals were discovered in 1888 by the biochemist Friedrich Reinitzer 
[103-105] whilst studying the compound cholesteryl benzoate obtained from carrots. This 
compound was further and extensively studied by Otto Lehmann [106]. In a classic treatise, 
in 1922, Georges Friedel recognised liquid crystals as comprising a new form of matter, and 
reluctantly coined new names for the then known liquid-crystalline, or mesomorphic phases; 
nematic (ur/fia, thread), smectic (afir/^/jLa, soap), and cholesteric [107, 108]. Nematic 
liquid crystals have orientational order, but lack positional order (see Fig. 2). Smectic 
phases form a layered structure, and although they too also have orientational order, they 
lack positional order within the layers. Today, liquid crystals are ubiquitous as the active 
component of a whole spectrum of display devices. 

One of the distinguishing feature of liquid crystals is their shape anisotropy. Lars On- 
sager [109] showed that (lyotropic) nematic phases form at vanishingly low concentrations 
for a solution of infinitely thin, infinitely long, rods. He derived an expression for the free 
energy in terms of an orientational distribution function, where the orientational and posi- 
tional contributions can be considered independently. In sacrificing orientational entropy 
(i.e. by aligning) many more positions are freed up. This competition eventually leads to an 
orientationally ordered phase as the concentration, or density, is increased. However, an un- 
derstanding of the liquid crystal phase for less idealised models had to wait for the advent of 
computer simulations. One of the first simulations was that of Vieillard-Baron [110], who, 
in 1972, studied the I-N transition for a system of 170 two-dimensional hard ellipses. A 
wonderful simulation was the growth of a smectic phase in 1988 by Frenkel, Lekkerkerker 
and Stroobants [111] for a system of 576 hard spherocylinders having a length to diameter 
ratio of 5. For the case of linear-tangential hard spheres, Monte Carlo simulations have 
produced smectic phases for m = 5 and both nematic and smectic phases for m > 6 [1 12] . 
The first demonstration of spontaneous liquid crystal phase formation for a 'realistic' meso- 
gen was shown by McBride, Wilson and Howard in 1998 [113] in a molecular dynamics 
simulation of the molecule 4,4'-di-n-pentyl-bibicyclo [2.2.2]octane. In 2001 the first thou- 
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Figure 2: Snapshot of a nematic liquid crystal phase from a Monte Carlo simulation of 
hard prolate (1x1x6) ellipsoids. 

sand molecule simulation of a mesogen at the fully atomistic level was performed [1 14] for 
the molecule PCH5. 

For two excellent reviews on liquid crystal simulations see Wilson [115] and Care and 
Cleaver [116]. 

4.3 Water 

Water has elicited sustained scientific interest for well over 100 years, notably the early 
work of Gustav Tammann and Percy Bridgman [117-121]. This is partly because of its 
'anomalous' physical properties, and partly because of its integral role in biological pro- 
cesses. Water still holds many surprises install, with ices XIII and XIV having only just 
been found in 2006 [122, 123]. Pioneering simulation work was performed by Barker Watts 
(Monte Carlo) [124] and Rahman and Stillinger (molecular dynamics) [125]. Since then a 
great many 'simple' models of water have been proposed. For an extensive review see Guil- 
lot [126]. The most popular models seem to be the 'four point transferable intermolecular 
potential' (TIP4P) [127] (with over 5500 citations at the time of writing), and the 'sim- 
ple point charge-extended' (SPC/E) model [128] (with nearly 2000 citations at the time of 
writing). The more recent TIP5P [129] is also attracting substantial interest. A wonderful 
example of the power of such seemingly simple models is shown when one calculates the 
water and ice phase diagram. This task was recently undertaken by the Vega group for 
the TIP4P and SPC/E models [130] using a combination of Monte Carlo simulations along 
with Gibbs-Duhem integration [131] to trace out the phase boundaries. The TIP4P model is 
able to make a fair representation of the experimental phase diagram. Not at all bad if one 
considers that the model was designed solely for the liquid phase. Water has been studied 
up to 70,000 K and 3.7 g/cm 3 using density functional theory [132], and it's structure has 



New Research on Computer Physics 



11 



been examined using FPMD (see §4.1) in the complete basis set limit [133] resulting in 
good agreement with experimental results obtained from neutron scattering. 

4.4 Biomolecular systems 

Biomolecular systems are composed of one or various macromolecules, often immersed 
in a solvent, such as water. However, their size is not the only obstacle to simulations 
of such systems. Proteins, in their denatured state adopt a more-or-less random configura- 
tion. Proteins in their natural state almost un-erringly adopt a unique conformation (tertiary 
structure). However, this is usually just one local minima of potentially a large number of 
minima. The essential question is how do these proteins repeatedly and reversibly 'fold' 
into their biologically active conformation. Given the size of proteins, an exploration of 
phase space in order to arrive at a minimum energy conformation would take an inordi- 
nate amount of time; however, protein folding is rapid (ps 100ns for a helices, 1/u.s for (5 
hairpins [134]) in an adequate (physiological) environment. This led to the idea of a path- 
way for protein folding. These questions were raised in two summaries written by Cyrus 
Levinthal [135, 136] and resulted in a Nobel Prize in chemistry for Christian Anfinsen [137] 
for his 'thermodynamic hypothesis'; the Gibbs free energy of the system is at a minimum 
in the natural state. 

A land-mark study was the 1 microsecond simulation of the 36-residue villin headpiece 
subdomain by Duan and Kollman [138, 139]. Although 1 microsecond was not sufficient 
to see the entire folding process (of the order of 10-100 /its) it demonstrated the feasibility 
of such simulations. Recently, an even smaller protein, an artificial peptide called chigno- 
lin, has been synthesised. It consists of only 10 amino acid residues [140]. This peptide 
has been the subject of a replica exchange molecular dynamics (REMD) simulation [141] 
leading to reproducible folding. Hubner, Deeds, and Shakhnovich have since developed a 
fully transferable potential for use in all-atom models [142] that results in reliable 'high- 
resolution' folding. It is also worth noting that although these biomolecules are very large, 
it is not unusual to dedicate a substantial part of the computation task to the simulation of 
the solvent [143]. 

All said and done, there are occasions where the incredibly complex folding process 
goes awry and the proteins misfold [144], either following their synthesis on the ribosome, 
or later, resulting in harmful isoforms known as prions [145, 146]. Misfolded proteins tend 
to form aggregates whose deposits can lead to devastating conditions such as Alzheimer's 
and Parkinsons diseases [147]. Thus a good understanding of how proteins fold is one of 
the most important present-day challenges. A testament to this importance is that IBM has 
built the worlds currently fastest supercomputer (the aforementioned Blue Gene/L machine) 
with these problems in mind. 

It is important to mention the Folding@Home distributed computing initiative [148]. 
This project makes use of the millions of computers in the world that are lying idle at any 
one point in time. Members of the public are free to download mini-simulations, run them 
as a screen-saver, and then return the completed calculations to Folding® Home. 
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5 The Future 

To prognosticate about the future is a notorious business. However, in the short term won- 
derful new insights into the liquid state are inevitable. 

The invention of the World Wide Web at the Conseil Europeen pour la Recherche 
Nucleaire (CERN) in 1989 revolutionised the distribution and communication of informa- 
tion. With over 90% of all the scientific papers ever published now available on-line, one 
can perform a comprehensive literature search in maybe less than an hour. 

Now, once again, CERN is the driving force behind another change in the way we 
work. The development of the biggest physics experiment ever, the Large Hadron Collider 
(LHC), will result in an incredible amount of data in need of analysis. It is estimated 
that when in operation the LHC will produce an overwhelming 15 million gigabytes of 
information per year. In order to process such a vast amount of data CERN is supporting 
the LHC Computing Grid and the Enabling Grids for E-sciencE projects. A GRID 
is, as its name suggests, a grid of computers connected via very high speed Internet 
connections (around one gigabyte/second). The idea is to share computing power and 
storage capacity on a global scale, thus effectively creating one vast computational resource. 

Data, usually encountered in a plethora of proprietary formats, can be difficult to analyse 
or visualise without specialised programs. It is likely that some format based on the W3C 
Extensible Markup Language (XML [149]) will allow greater access to this data, and will 
result in more efficient data management and analysis. 

On a local scale, open software, such as the Linux operating system, along with off-the- 
shelf hardware have allowed research groups to build mini-super computers at relatively 
little cost. A model for such systems are the so-called Beowulf clusters [150], using the 
Message Passing Interface (MPI) [151] calls for parallel computer programs. 

Simulations that can be performed independently, or 'concurrently' have found popu- 
lar support. The enthusiasm of the general public for science is amply demonstrated by 
distributed computing '@' projects, such as the hugely successful Folding ©Home, having 
access to nearly two million home computers with an estimated 200 TFLOPS of collective 
computing power at the time of writing. 

50 years on from the first computer simulations on MANIAC-I, Los Alamos National 
Laboratory will be host to the IBM 'Roadrunner' super-computer, with a projected sus- 
tained speed of 1 petaflop and a peak performance of 1.6 petaflops. Roadrunner will have 
over 32,000 Cell/AMD Opteron processors, a long way from the 1024 vacuum tubes of the 
MANIAC-I. 

Could 2007 be another key year for liquid state physics? 
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